Dynamic Fractional Resource Scheduling vs. Batch Scheduling
نویسندگان
چکیده
We propose a novel job scheduling approach for homogeneous cluster computing platforms. Its key feature is the use of virtual machine technology to share fractional node resources in a precise and controlled manner. Other VM-based scheduling approaches have focused primarily on technical issues or on extensions to existing batch scheduling systems, while we take a more aggressive approach and seek to find heuristics that maximize an objective metric correlated with job performance. We derive absolute performance bounds and develop algorithms for the online, non-clairvoyant version of our scheduling problem. We further evaluate these algorithms in simulation against both synthetic and real-world HPC workloads and compare our algorithms to standard batch scheduling approaches. We find that our approach improves over batch scheduling by orders of magnitude in terms of job stretch, while leading to comparable or better resource utilization. Our results demonstrate that virtualization technology coupled with lightweight online scheduling strategies can afford dramatic improvements in performance for executing HPC workloads. Key-words: cluster, scheduler, virtual machine, vector binpacking, high performance computing, batch scheduling, stretch in ria -0 06 03 09 1, v er si on 1 24 J un 2 01 1 Ordonnancement dynamique et fractionnaire des ressources versus ordonnancement par batch Résumé : Nous proposons une nouvelle approche de l’ordonnancement des applications sur les calculateurs parallèles homogènes. Sa principale caractéristique est l’utilisation de machines virtuelles pour organiser le partage de fractions des ressources de manière précise et contrôlée. Les approches existantes utilisant des machines virtuelles se sont principalement intéressées à des problèmes techniques ou à l’extension des systèmes de batch existants. Notre approche est beaucoup plus aggressive et nous recherchons des heuristiques qui optimisent une métrique particulière. Nous établissons des bornes de performance absolues et nous développons des algorithmes pour la version en-ligne, non clairvoyante, de notre problème d’ordonnancement. Nous évaluons ces algorithmes au moyen de simulations impliquant soit des traces synthétiques, soit de traces d’un système HPC existant. Nous comparons par ce moyen nos solutions aux algorithmes d’ordonnancement par batch les plus classiques. Nous montrons que notre approche permet d’améliorer de plusieurs ordres de grandeur le facteur de ralentissement (stretch) subit par les applications par rapport aux systèmes de batch, tout en ayant une utilisation comparable ou moindre des ressources. Nos résultats montrent que l’utilisation conjointe des techniques de virtualisation et de stratégies d’ordonnancement en-ligne permet d’améliorer très significativement l’exécution des applications dans les systèmes de calcul HPC. Mots-clés : ordonnancement, machines virtuelles, bin packing vectoriel, calcul haute performance, ordonnancement par batch, stretch in ria -0 06 03 09 1, v er si on 1 24 J un 2 01 1 DFRS vs. Batch Scheduling 3
منابع مشابه
Dynamic Resource Management and Job Scheduling for High Performance Computing = Dynamisches Ressourcenmanagement und Job-Scheduling für das Hochleistungsrechnen
Job scheduling and resource management plays an essential role in high-performance computing. Supercomputing resources are usually managed by a batch system, which is responsible for the effective mapping of jobs onto resources (i.e., compute nodes). From the system perspective, a batch system must ensure high system utilization and throughput, while from the user perspective it must ensure fas...
متن کاملRole of batch size in scheduling optimization of flexible manufacturing system using genetic algorithm
Flexible manufacturing system (FMS) readily addresses the dynamic needs of the customers in terms of variety and quality. At present, there is a need to produce a wide range of quality products in limited time span. On-time delivery of customers’ orders is critical in make-to-order (MTO) manufacturing systems. The completion time of the orders depends on several factors including arrival rate, ...
متن کاملSimultaneous Batching and Scheduling Using Dynamic Decomposition on a Grid
Scheduling problems arise in many applications in process industries. However, despite various efforts to develop efficient scheduling methods, current approaches cannot be used to solve instances of industrial importance in reasonable time frames. The goal of this paper is the development of a dynamic decomposition scheme that exploits the structure of the problem and is well suited for grid c...
متن کاملAn Efficient Bi-objective Genetic Algorithm for the Single Batch-Processing Machine Scheduling Problem with Sequence Dependent Family Setup Time and Non-identical Job Sizes
This paper considers the problem of minimizing make-span and maximum tardiness simultaneously for scheduling jobs under non-identical job sizes, dynamic job arrivals, incompatible job families,and sequence-dependentfamily setup time on the single batch- processor, where split size of jobs is allowed between batches. At first, a new Mixed Integer Linear Programming (MILP) model is proposed for t...
متن کاملJob Scheduling for Prime Time vs. Non-Prime Time
Current job scheduling systems for massively parallel machines and Beowulf-class compute clusters support batch scheduling involving two classes of queues: prime time vs. non-prime time. Jobs running in these queue classes must satisfy different criteria with respect to jobsize, runtime, or other resource needs. These constraints are designed to delay big jobs to non-prime time in order to prov...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1106.4985 شماره
صفحات -
تاریخ انتشار 2011